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ABSTRACT 

Although videotape is the preferred method for 
recording behavior, there are many natural settings in which it is 
impractical or too intrusive. The ability to collect data in such 
settings and in real time has been enhanced by the advent of light, 
powerful laptop computers. This poster (with paper) describes a set 
of eight programs (developed in British Columbia) for collecting and 
analyzing f ocal~individual data (i.e., events and interactions) and 
scan-sample data (states) on such IBM-compatible machines. These 
computer programs can accommodate almost any user^specif ied set of 
codes, including codes for simultaneous events. Reliabilities 
(percent agreement and kappa), rates, frequencies, cumulative time 
(time-budget data), and conditional probabilities with z^scores and 
exact binomial probabilities can be generated. Binomial probabilities 
test the null hypothesis that the conditional probability is at 
chance levels. General contingency across the entire set of codes can 
be assessed by log-linear analysis, and specific patterns of 
contingent behavior can be identified automatically. Four figures 
illustrate the discussion, (Contains 3 references.) (Author/SLD) 
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Abstract 

Although videotape is the preferred method for recording behavior, there are many oatural settings in 
which it is impractical or too intrusive, ^le ability to collect observational data in such settings and in real time 
has been eniianced by the advent of light, powerful laptop computers. This poster describes a set of 8 programs 
for collecting and analyzing focal-individual dau (i^e., events and interactions) and scan-sample data (states) on 
such IBM-compatible machines. They can accommodate almost any user>specified set of codes, including codes 
for simultaneous events. Reliabilities (percent agreement and kappa), rates, frequencies, cumulative time (time- 
budget data) and conditional probabilities with z-scores and exact binomial probabilities can be generated. 
(Binomial probabilities test the null hypothesis that the condidonal probability is at chance levels.) General 
contingency across the entire set of codes can be assessed by log-linear analysis; and specific patterns of 
contingent behavior can be identified automadcally. 

-Because these programs were developed under funding from the Social Sciences and Humanities 
Research Council of danada, the programs and manual arc available without charge for purposes of 
research and teaching. Please contact William Roberts, Department of Psychology, C!ariboo College, P.O. Box 
3010, Kamloops, B.C. V2C 5N3, Canada. 
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Programs for che coUectiofl and analysis of observational data 
William Roberts 

Although videotape is the preferred method for recording befaavion there are many natural settings in 
which it is impractical or too intrusive. The ability to collect observational data in such settings and in real time 
has been eniianced by the advent of light, powerful, laptop computers. This poster describes a set of programs 
for collecting and analyzing such data on IBM-compatible machines. Because they were developed under 
fundmg from the Social Sciences and Hiunanities Research Council of Can?uia , the programs and manual are 
available without charge for purposes of research and teaching. 
Data entry 

The data-entry program (FOCAL) is designed to collect focal-individual data (Altmann, 1974) recorded 
in "inidator-action-target'' format Because the target's response can be entered in the following 'sentence*, 
interacdons between individuals can be coded as easily as the behavior of single individuals. FOCAL can also 
be used to record scan-sample data (that is, states rather than events and interactions). Users specify all codes as 
well as the length of the focal sample. Duradon is recorded automatically. See Figure 1. 



Insert Figure 1 about here 



During data collection, the screen displays the previous dozen data 'sentences' as well as die one being 
currendy entered. Errors in the current line can be corrected immediately; errors in earlier Imes can be flagged 
for correction after die session ends. 'Prompt screens* display behavior and person codes; the current focal 
person, the trial number, and dmc elapsed in the current focal sample are also displayed. Written notes can be 
entered at the end of each sample. 
Assessing reliability 

The reliability program (KAPPA) reports kappa* the standard error of kappa, and percent agxecmenr for 
the codes as a set. Percent agreement is also reported for each individual code, along with die four codes most 
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frequently confused witb it (See Figure 2/} Because observers coding in real time will necessarily be sllglitiy out 
of synchrony even when they agree about which codes to use, KAPPA allows the user to specify how much time 
discrepancy to tolerate. As shown in Figure 3, matching sequences and events can be madced for the two data 
files being compared, as an aid in training observers and understanding low reliabilities. Codes can be combined 
into new categories without altering the original data file. 

Insert Figure 2 about here 



Insert Figure 3 about here 



Data analysis 

TIMES reports frequency, rate, and time-budget data (cumulative time as a proportion of total time 
observed for continuously coded data* and frequency as a proportion of total events for scan-sample data). 
TIMES can handle simultaneous events. 

Sequential analyses can be carried out using LAG and SEARCH. For user-designated events, LAG 
reports cnnditional probabilities, their z-scores and exact binomial probabilities (as a test of the null hypothesis 
that the condidonal probability is at chance levels), and an index for determining if enough events have been 
, collected (Bakeman & (jottman, 1986). If requested, LAG will also produce output for a log-linear analysis, 
allowing contingency to be assessed for die taxonomy as a whole. (This output is in die form of an mstruction 
file that can be rtm directly by BMDP.) 

Using techniques described by Bakeman and Gottman (1986) and Sackett (1979), SEARCH exammes 
LAG output and identifies significant sequences of events. Any number of brandling sequences, each up to 
30 events in length* can be identified. Sample output is illustrated in Figure 4. SEARCH wiU automatically 
generate required LAG output The level of alpha required for significance of die individual LAG analyses is 
user-specified. The significance of die sequence as a whole is also assessed. Maximum sequence kngth can be 
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specified. Both LAG and SEARCH penuit beiiavior codes to be combined or data to be pooled across subjects. 

Insert Figure 4 about here 



Data management 

Three utility programs are also included. SCAN reads FOCAL data files and reports summary statistics 
(total trials, events, and time observed) and checks for cocumon errors that arise when data arc edited. Thus this 
program is useful for tracking the work accomplished by various members of a research team. COMPARE can 
be used to examine back-up and current copies of files. Discrepancies arc displayed in context (Le., with 
surrounding lines). EDIT is a stand-alone versjon of die Word-Perfect-like text processor incorporated into 
FOCAL. It can be used to edit data or note files. 
Qeneral features 

Programs can be run from menus or the DOS prompt. Muldple analyses can be run &om batch files. 
Because data and output files are DOS Text files, they can be read by standard word processing programs and 
statistical packages such as BMDP. 

Technical support is available from the author without charge. 
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Figure 1: a sample data file. 

Family 41, Home Session 1. 22 JAN 1990 at 17:06 Trial I, Focal person= 3, Observers Paula 

2 20 3 3.9 4.0 

3 20 2 1.4 53 

2 20 3 1.2 6.5 

3 20 2 3.5 lOO 
3 0;i 3 9.6 19,6 
2 2(1 3 9.0 28.6 

2 13 3 3.4 31.9 

3 0-*: 3 2.6 34.5 
3 01 3 4.6 39.1 
3 20 2 1.7 40.8 

2 20 3 ■ 4.6 45.4 

3 78 2 5.4 50.8 
3 01 3 11.4 62.1 
3 05 3 16.0 78.1 
3 01 3 2.0 80.1 
3 78 9 2.4 825 
9 27 3 5.8 88.3 
3 01 3 40.0 128.3 
-I 

Family 41, Home Session 1. 22 JAN 1990 at 17:19 Trial 2, Focal peison= 2, Observers Paula 

2 3 10.3 10.3 

2 22 4 8.0 18.3 

2 20 4 10.0 28.3 

3 20 4 5.6 33.9 

2 20 3 3.4 37.3 

3 20 2 10.2 47.5 
2 78 3 14 49.9 
2 20 3 5.1 54,9 

2 22 4 3.4 58.3 

3 20 2 3.3 61.6 

2 20 3 15 64.1 

3 20 2 10 66.0 

2 20 3 6.1 72.1 

3 20 2 1.3 73.3 

2 20 3 4.6 77.9 

3 20 2 15 80.4 
2 20 3 10.2 90.6 
2 22 4 20.3 110.9 
2 20 4 3.6 114.5 

4 22 2 7.7 122.2 
-I 
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Figure 2. Reliabilities: sample output £rom KAPPA 

Reliabilities: Per cent agreement and kappa 
Tuesday, March 16, 1993 12:40 p-m. 

Data from file C;\RESEARCH\TORONTO\RELIABLE\LISA104 .DAT 

collected 17 NOV 1988. Coder:= Lisa 
Data from file C:\RESEARCH\TORONTO\RELIABLE\TALLY104, DAT 

collected 18 NOV 1988. Coder= tally 

Reliabilities calculated over 22.6 minutes across 3 trials 

Permitted error in timing between sequences: 10 seconds 
Permitted error when identifying sequences: 10 seconds 
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Total agreements / total comparisonss 93 / 108 

Overall proportion of agreements .861 

Kappa= .825 

Standard Error of Kappa=s .046 

Approximate 95% confidence interval^ .733 to .917 
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Figure 3. Matching sequencess and events in two reliability files. 

Session 2. 18 NOV 1988 at 9:54. Trial 1, Focal person 3. 
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Agreements / comparisons^ 45 / 52. Percent Agreements: 86.5 
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Agreemencs / comparisons= 45/52. 



1988 at 9:55. Trial 3, Focal person 3. 



Percent Agreement^ 86.5 
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Figure 4. Patterns of behavior: output from SEARCH 

Data read from file C; \STATS\918 .DAT. 
Required alphas .025 

Saturday, August 22, 1992 11:33 a.m. 

Sequence 1 of 8 

This sequence followed the criterion 7 of 93 times (.075); 
expected probability^ .006, z= 8.840, binomial p= .0000 

Lag Response frequency lag p Z liinomial p Stability index 
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(Sequence 2 omitted) 
Sequence 3 of 8 

This sequence followed the criterion 13 of 93 times (.140); 
expected probability^ .013, z» 11.006, binomi.:?l p= .0000 

Lag Response frequency lag p 2 p Stability index 



0 1 50 2 

1 1 30 2 47 

2 1 10 2 29 

3 1 11 2 66 

4 1 20 2 22 

5 1 21 2 83 

6 1 31 2 29 

7 1 40 2 73 

8 1 41 2 88 

[Sequences 4 to 8 omitted.] 

Notes: Under the null hypothesis, the expected probability for observing a 
sequence is the product of its lag 1 probabilities; this value is reported 
above. The z. score is the familiar ^ test for a , proportion . Sequence 3 
diverges from Sequence 1 for lags 1 to 3 , then rejoins at lag 4. 
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